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METHOD AND SYSTEM FOR TRANSFERRING 
CONTENT TO A NETWORKED UNIT 



TECHNICAL FIELD OF THE INVENTION 

The present invention generally relates to networked information 
distribution systems, and in particular, to a method and system for distributing 

10 digitized content, such as images, videos, or other files, to end users for real-time 

presentation. 



BACKGROUND OF THE INVENTION 

Many existing communication networks do not provide sufficient 

15 bandwidth to support real-time or near real-time presentation of content, such as 
videos, images or other large media files. The telephone network has been 
suggested as a means for transporting real-time content. However, systems 
using the public switched telephone network (PSTN) are often bandwidth limited, 
providing low quality presentation and services for content such as real-time 

20 video. 

U.S. Pat. No. 5,410,343, assigned to Bell Atlantic Network Services, Inc., 
discloses a video-on-demand (VOD) system which relies upon telephone lines 
for distribution. Specifically, the system disclosed by the £ 343 patent uses 
asymmetric digital subscriber line (ADSL) channels, together with the public 
25 switched telephone network, for delivery of video program material. Generally, 
ADSL channels are bi-directional digital links having a high bandwidth 
downstream link and a low bandwidth upstream link. The advantage of ADSL is 
that higher bandwidth signals can be transmitted on ordinary telephone lines to 
end-user premises and additional cable or wiring is not necessary. 

30 
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Although the system described in the ' 343 patent represents a significant 
improvement in the distribution of video programs, the system itself does not 
have sufficient downstream bandwidth to support high-quality VOD services, 
5 similar to those provided by digital broadcast service (DBS) providers. The 
downstream ADSL channel described in the ' 343 patent has a bandwidth of 
1.544 Mbps, whereas contemporary digital cable service providers offer VOD 
services at bit rates ranging between 3-6 Mbps. The relatively limited bandwidth 
of the ADSL system does not permit real-time content at as high of quality as the 

10 digital cable and satellite broadcast systems. 

U.S. Patent No. 5,790,935, assigned to Hughes Aircraft Company, 
discloses a satellite-based digital broadcast system that provides virtual VOD 
services to subscribers. To reduce peak bandwidth demand, the system 
described in the '935 patent selectively downloads and caches digital files, such 

15 as videos, to subscriber units during off-peak hours. The disclosed system 

potentially improves the responsiveness of on-demand services by making some 
content locally available at subscriber units, without the need to access network 
resources. However, the '935 system requires content files to be stored in their 
entirety at the subscriber units. For large content files, such as videos, this 

20 requires larger storage devices in the subscriber units, and thus, increases the 
cost of subscriber units. Moreover, the '935 system does not utilize pre-existing 
network infrastructure, such as the PSTN, and instead relies on a wireless 
satellite communication network dedicated to video broadcast. 

Accordingly, there is a need for an improved transfer system and method 

25 that permits the transport of real-time content over pre-existing networks, such as 
the PSTN. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram illustrating a content transfer scheme in 
accordance with the present invention; 
5 FIG. 2 is a block diagram illustrating details of the client shown in FIG. 1 ; 

FIG. 3 is a conceptual diagram illustrating one approach to decomposing 
content; 

FIG. 4 is a conceptual diagram illustrating a pyramidal scheme for 

decomposing content; 
10 FIG. 5 is a diagram illustrating a DSL system in accordance with an 

exemplary embodiment of the present invention; 

FIG. 6 is a graph illustrating an exemplary ADSL spectrum; 

FIG. 7 is a flowchart illustrating operation of the central office shown in 

FIG. 5; 

1 5 FIG. 8 is a flowchart illustrating operation of the set-top box shown in 

FIG. 5; 

FIG. 9 is a process chart illustrating a DCT algorithm for compressing 
content; 

FIG. 10 is a process diagram illustrating the decomposition of content 
20 using an MPEG-2 compression algorithm; 

FIG. 11 is an exemplary DCT coefficient map; 

FIG. 12 is a graph showing an exemplary signal spectrum for a sub-band 

coding algorithm; 

FIG. 13 is a decomposition map of an image frame that has been 

25 decomposed into seven sub-bands; and 

FIG. 14 is a block diagram illustrating vector quantization processes for 
encoding and decoding content. 
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DETAILED DESCRIPTION OF THE 
PRESENTLY PREFERRED EMBODIMENTS 

It is an advantage of the present invention to provide an improved system 
5 and method for distributing content over a network to end users. According to 
one embodiment of the invention, content files are separated into portions at a 
content repository. Each of the files can be separated as a function of a 
compression algorithm. At least one of the portions of a file is then downloaded 
to a user terminal for storage therein. When the content is selected for 
1 0 presentation at the user terminal, one or more other portions of the file are then 
streamed in real-time to the user terminal for recombination with the locally 
stored portion. The recombined content is then presented to the end user. 

The present invention is applicable in many networked systems where 
bandwidth limitations negatively impact the quality of services provided to users. 
1 5 For example, the invention can be employed to transport content over the 

Internet, private intranets, the PSTN, wireless telecommunication networks, and 
the like. 

In particular, the invention can significantly improve services requiring the 
transport of video over ADSL networks. It is difficult to offer a video-on-demand 
20 (VOD) service at high quality over ADSL networks having an industry standard 
downstream bandwidth of 1 .544 Mbps. Because of the bandwidth limitations of 
standard ADSL networks, real-time video quality is typically less than the quality 

provided by satellite-based digital broadcast systems (DBSs) and cable 
operators. Currently, DBS providers and digital cable providers offer high quality 
25 VOD services at bit rates ranging between 3-6 Mbps. By employing the present 
invention, service providers could offer higher quality VOD services over ADSL 
networks at 1 .544 Mbps. 
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Turning now to the drawings, and in particular FIG. 1 , there is illustrated a 
block diagram of an exemplary content transfer system 20 in accordance with the 
present invention. Within the system 20, a content repository 22 stores low 
5 quality (LQ) and high quality (AQ) portions 24-26 of content files. The portions 
24-26 are transferable to a client 28 over a network 30, such as an asymmetrical 
digital subscriber line (ADSL) network. 

The content can be any type of information that can be represented in a 
computer-usable format and transported over a communications network, such 
1 0 as video, image, audio, data or the like. Preferably, the content includes digitized 
video files representing movies. In accordance with the invention, content files 
«s are decomposed into LQ parts 24 and AQ parts 26 using a compression 

S technique such as wavelets, sub-band coding, discrete cosine transform (DCT), 

p vector quantization (VQ), or other progressive compression methods. 

% 15 The system 20 can use partial, pyramid progressive compression 

P techniques to provide VOD services. In progressive compression techniques, 

r compressed information is transmitted in a progressive manner, allowing the 

J receiver side (i.e., client 28) to reconstruct a lower quality version of the image 

II first, and then as it receives more information from the transmit side, the quality is 

□ 20 enhanced progressively. 

* The repository 22 downloads LQ or AQ portions of one or more of content 

files for local storage at the client 28, prior to viewing by the end user. In 
response to user selections made at the client 28, the remaining part(s) of the 
content files can be sent in real-time from the repository 24 to the client 28. The 

25 advantage of this method over downloading entire files into the client 28 is that a 
greater number of files can be made available to the client 28 without increasing 
the storage size in the client 28, thus offering a larger selection of content at a 
potentially higher quality. Another benefit is related to network congestion. In 
case of network congestion where a real-time AQ download is disrupted or 

30 unavailable, the LQ part of the content is available to the client and the content 
file can be presented at a reduced quality level. 
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ln terms of implementation, the system 20 does not require expensive 
client and codec design. For video content, the client 28 is able to recompose 
each video frame from LQ and AQ and then perform the task of decompressing 
5 the video frames. Currently, many commercially-available digital set-top boxes 
are capable of decompressing DCT-based compressed video frames. 

FIG. 2 is a block diagram illustrating details of the client 28 shown in 
FIG. 1 . The client 28 can be a set-top box, personal computer, a wireless digital 
device, such as a personal digital assistant, or any other suitable computing 
10 device for performing the client functions disclosed herein. In the exemplary 

architecture shown, the client 28 includes a storage device 50, a AQ buffer 52, an 
LQ buffer 54, a re-composition device 55, a frame buffer 56 and a video decoder 
58. 

In operation for viewing digitized movie videos, the LQ parts of n movies 
1 5 are downloaded to the client's storage device 50 using the ADSL network 30. 

The LQ parts can be downloaded during off-peak hours. The number n depends 
on the capacity of the storage device 50 in the client 28. When a user selects to 
view the movie "i", where i is a number between 1 and n, the client 28 sends a 
request to the repository 22 and asks for the AQ information corresponding to the 
20 movie "i". The client 28 starts receiving the AQ information in real-time and after 
recomposing the frames, it decompresses the video and sends it to a TV monitor 
for viewing. 

The movie is recomposed frame by frame. To accomplish this, each 
digitized AQ frame j received in real-time is temporarily stored in the buffer 52, 

25 where j is a number. The corresponding jth LQ frame is retrieved from the 
storage device 50 and held in buffer 54. The buffered frames are then 
recomposed by the re-composition device 55, which in turn stores the 
recomposed frame in the buffer 56. Recomposed frames are then provided to 
the video decoder 58, which outputs the decoded video for presentation by a 

30 suitable display device, such as a computer or TV monitor. 
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The buffers 52-56 and storage device 50 can be any suitable memory 
devices for storing digitized information. The re-composition device can be 
implemented using a programmable processor, such as a digital signal processor 
5 (DSP) or other microprocessor, that executes software code for recombining the 
LQ and AQ frames in the manner described herein. Alternatively, the 
re-composition device 55, buffers 52-56 and storage device 50, can be 
implemented using one or more application specific integrated circuits (ASICs), 
or any suitable combination of standard hardware components, ASICs, and 

1 0 programmable devices. 

The video decoder 58 can be a commercially-available video codec. 
FIG. 3 is a conceptual diagram illustrating one approach to decomposing 

content. As mentioned above, each content file is compressed and divided into a 
LQ and AQ. The LQ part of the compressed content corresponds to the basic 

15 core of the content. Decompressing and decoding LQ reproduces a lower quality 
version of the content. When adding AQ to LQ and decompressing and decoding 
the content, the quality is improved significantly. 

Because in some network systems, such as ADSL networks, the 
bandwidth available to end-users varies, depending on factors such as the 

20 end-users 1 distance from the central office, the approach shown in FIG. 3 
consists of storing a single LQ for each content file and different AQs for the 
same LQ in the content repository 22. At the content creation phase, the 
different AQs are optimized for different bandwidths. Each combination of LQ 
and AQs produces a different content quality, where i = (a,b,c,d). FIG. 3 shows 

25 customers with different bandwidths being offered the same content at different 
bit rates. This approach enables a service provider to provide different levels of 
service. 
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Forthe example file 100 shown in FIG. 3, a single LQ portion is provided 
by the repository, along with four AQ portions: AQ a , AQ b , AQ C , and AQ d . Each of 
the AQ portions can be independently combined with the LQ portion to produce a 

5 final content file having a different quality level to satisfy particular bandwidth. In 
the example shown, AQ a can be combined with LQ for use with a 1 .5 Mbps 
network bandwidth, AQ b can be combined with LQ for use with a 3 Mbps 
bandwidth, AQ C can be combined with LQ for a 4.5 Mbps bandwidth, and AQ d can 
be combined for use with a 6 Mbps bandwidth. 

1 0 The following description provides examples of how two different 

end-users with two different bandwidths are provided a VOD service with the 

approach of FIG. 3. 

For an end user with a bandwidth of 1 .5 Mbps, the end-user selects a 
content file from a list of available files for which the LQs are already downloaded 

1 5 to the client 28. After the selection is made the repository 22 is alerted of the 
end-user's choice and a server selects the AQ a that is optimized for users with a 
bandwidth of 1 .5 Mbps and starts streaming the AQ a content to the client 28. 
The client 28 receives the AQ a information and combines it with the LQ 
information that is already stored in the client and then decodes the re-combined 

20 content and displays it on a display device. 

For an end user with a bandwidth of 3 Mbps, after the content selection 
process by the end user, the repository server selects AQ b that is optimized for 
the 3 Mbps network bandwidth and starts streaming the AQ b content to the client 
28. The client 28 receives the AQ b information and combines it with the LQ 

25 information that is already stored in the client and then decodes the re-combined 
content and displays it on a display device. 
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FIG. 5 is a conceptual diagram illustrating a second approach to 
decomposing content. This approach relies on a pyramidal scheme 1 10 for 
decomposing content. Under this scheme, each compressed content file is 
5 decomposed into an LQ and a AQ that is subdivided into smaller sections: 



n 



AQ= I q. 

1 1 ( 1 ) 
1 = 1 



The combined compressed content is: 

Compressed Content = e(lQ + £q.) ( 2 ) 

The LQ and q\ portions are created and stored in the repository 22. 
1 0 Depending of the network bandwidth available, the number n changes. 

For example, for a bandwidth of 1 .5Mbps, n may be 5 and for a 3 Mbps 
bandwidth n may be 10. As with the first approach discussed in connection with 
FIG.3, the LQ portion is stored in the client 28. 

For the 1 .5 Mbps network, the q^ q 2 , q3, q4, and q 5 portions are 
15 downloaded in real-time. At the client 28, the LQ and q portions are re- 
composed and decoded so that the output content is: LQ + qi + q2 + q3 + Q4 + Qs- 

For the 3 Mbps network, the qi, q 2 , - . . , q9, and qi 0 portions are 
downloaded in real-time. At the client 28, the LQ and q portions are re- 
composed and decoded so that the output content is: LQ + q-i + q 2 ■ • • + qg+ q-io- 
20 FIG. 5 is a diagram illustrating an exemplary digital subscriber line (DSL) 

system 200 in accordance with an embodiment of the present invention. The 
example network architecture is capable of streaming video content from a 
central office (CO) 202 to customer premises. The system 200 includes the CO 
202 communicating with a customer premise 204 using an ADSL link 206. 
25 Although not shown, the system 200 can support multiple customer premises 
and ADSL links to the CO 202. 
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The CO 202 includes a splitter 214, a digital subscriber line access 
multiplexer (DSLAM) 216, a voice switch 222, an asynchronous transfer mode 
(ATM) switch 218, a content server 220 and storage device 226, and a content 

5 repository 221. 

The CO 202 can communicate with a data network 212, such as the 

Internet, through the ATM switch 218. The voice switch 222 can be a 
conventional telecommunications switch that permits the CO 202 to 
communicate with a voice network 208, such as the plain old telephone system 
10 (POTS). 

The customer premise 204 includes a splitter 240, an ADSL modem 242, 
an Ethernet hub/switch 246, a set-top box 248, a display device 250, such as a 
a TV, a personal computer (PC) 252, and a telephone 254. 

% DSL is a high-speed, dedicated, point-to-point network technology for 

1 5 delivering voice, video and data at speeds much faster than the analog modem 
m technology. DSL services are designed for the local loop, or "last mile" copper 

m from a central office (CO) to an end user's business or home. End users closer 

Q to the central office can be served with higher bandwidth connections. The 

m technology can deliver higher speed connections up to a range of 1 8,000 feet 

ffl 20 (5.5 km) from the CO. 

M A major advantage of DSL is that it uses the existing copper telephone 

wires to the customer locations. With 800 million phone lines deployed 
throughout the world, there is thus little need for new wiring to support DSL 
services. Many providers are offering a version of the DSL technology that is 
25 Asymmetric DSL (ADSL). 
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ADSL provides more bandwidth downstream for faster downloads where it 
is needed, than for uploads. ADSL Full Rate supports downstream speeds up to 
8 Mbps, and upstream rates up to 1 Mbps. FIG. 6 is a graph 280 illustrating an 
5 exemplary ADSL spectrum. FIG. 6 illustrates how the DMT (Discrete Multi-Tone) 
line code separates the phone line into three channels: up to 4 kHz channel 
reserved for voice, a 30 kHz to 138 kHz channel reserved for upstream traffic, 
and a 138 kHz to 1 .1 MHz channel reserved for downstream traffic. 

ADSL modems leverage signal processing techniques that insert and 
1 0 extract more digital data onto analog lines beyond the frequencies of normal 
voice services. Because the high frequency carrier signal can be modified, a 
larger digital data payload can be carried in the signal over greater distances 
using standard phone lines. 

In order to separate the regular voice service from the ADSL service, the 
15 splitters 214, 240 are used at the CO 202 and customer premise 204. When an 
ADSL transmission is received at the CO 202, the central office POTS splitter 
214 sends the voice traffic to the voice switch 222 and the data traffic to the 
DSLAM 216 and to the data network 212 and repository server 220. 

There are two types of DSLAMs usable with the system 200: a CO 
20 DSLAM, which is built for high density and concentration, and a remote DSLAM 
that is in a digital loop carrier (DLC) system in neighborhoods and office parks. 

The video content is acquired by the server 220 from the content 
repository 221 . The content repository 221 may be located at the CO 202 or in a 
different location. In this example, the video server 220 is connected to the ATM 
25 switch 218 that is linked to an ADSL DSLAM 216. 

In response to customer requests from the set-top box 248, video content 
is sent to the set-top box 248 from the repository 221 , and after recomposing, 
decompressing and decoding by the set-top box 248, the video is displayed on 
the TV monitor 250. 

30 
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FIG. 7 is a flowchart 300 illustrating operation of the central office shown 
in FIG. 5. In step 302, content to be stored in the repository 221 is compressed 
using a predetermined compression technique. Some of the compression 
techniques usable with the system 200 are described in further detail in 

connection with FIGS. 9-14. The compression algorithm can be executed by the 
server 220 or any other device for generating compressed content. 

In step 304, the compressed content files are decomposed into LQ and 
AQ portions. These portions are then stored in the repository 221 . 

Next, in step 306, one or more LQ portions are downloaded to the set-top 
box 248. These portions can be downloaded by the server 220 during off-peak 
hours, when the network usage is low. 

In step 308, a subscriber selection from the set-top box 248 is received at 
the server 220. The subscriber selection can be a message identifying a 
particular content file. The subscriber selection can be generated by a user 
interface (not shown) included in the set-top box 248. The user interface can 
present a menu on the TV monitor 250 that lists content files, such as movies, 
and permits the subscriber to make selections therefrom. 

In response to receiving the subscriber selection, the server 220 retrieves 

a corresponding AQ portion from the content repository 221 and then downloads 
the AQ portion to the set-top box 248 for real-time presentation. 

FIG. 8 is a flowchart 350 illustrating operation of the set-top box 248 
shown in FIG. 6. In step 352, LQ portions downloaded from the repository 221 
are stored locally within the set-top box 248. Next, in step 354, a subscriber 
selection is received by the set-top box 248. If the selection corresponds to a 
content file having a locally stored LQ portion, the selection request is then 
transferred to the server 220. In response to receiving a selection request, the 
server 220 begins streaming the AQ portion to the set-top box 248. 



- 12- 



- 13- 



Upon receiving the AQ portion, the set-top box 248 recomposes the LQ 
and AQ portions, as described in connection with FIGS. 2-4 (step 358). In step 
360, the set-top box 248 decodes the recomposed content. The decoded 
5 content is then transferred to the TV monitor 250 for real-time display (step 362). 

In order to create and decompose content transferred by the system 200, 
the content is compressed using a compression technique. The present 
invention is not dependent on any particular compression technique, and the 
following paragraphs describe examples of compression techniques that can be 
1 0 used with the video over ADSL transmission system 200, as well as the system 
20 of FIG. 1. 

Video content delivered by the system 200 is compressed using a video 
compression technique. Many conventional video compression techniques can 
be used to decompose the video into LQ and AQ parts. Three examples of 
15 compression techniques are described below. These techniques are discrete 
cosine transform (DCT) based algorithms, subband/wavelets-based algorithms, 
and vector quantization (VQ) algorithms. 

FIG. 9 is a process chart 400 illustrating a DCT algorithm for compressing 
content. There are a variety of techniques for compressing still and moving 
20 images (videos). One of the most popular compression technologies is 

transform-based. In a transform-based compression technology, an image is 
transformed using a transformation matrix, then quantized and transmitted. At 
the receiver, the data is de-quantized and an inverse transformation is applied to 
reconstruct the original content. DCT has been one of the most popular 
25 transformation techniques used in almost all standards-based compression 
technologies such as JPEG, MPEG-1, MPEG-2, MPEG-4, H.261, and H.263. 
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DCT is used on small blocks of pixels, usually 8x8 or 16x16 pixels. In 
most cases the energy of the video/image signal is concentrated in the lower 
frequency zone, and the higher frequency coefficients represent information on 
5 edges, transitions, and other finer details of the content. It is possible to 
reconstruct the original content by only a few DCT coefficients from the lower 
frequency area of the DCT matrix, and by disregarding coefficients in the higher 
frequency area. 

FIG. 9 describes the application of DCT on a still image. The image is first 
10 converted from an RGB format 402 into a Luminance (Y) and Chrominance (Cr, 
Cb) frames 404. The chrominance frames can be sub-sampled because color 
pixels are highly correlated. The Y and Cr and Cb frames are divided into small 
i blocks of 8x8, 1 6x1 6 or 32x32 pixels 405. The DCT is applied on the 8x8-pixel 

1 blocks followed by a quantization step 403. Then the DCT coefficients are 

1 5 arranged in a Zigzag manner 406, and higher frequency DCT coefficients are 
j| disregarded. DPCM, Run Length Encoding (RLE) and Huffman or arithmetic 

m coding 408 are used before transmitting the bits through a transmission channel. 

O This is the basic structure of JPEG. 

m For video, because of temporal correlation between frames, a search for a 

Jfj 20 motion vector is performed for each block in the previous frame. If a match is 

ii j 

U found, only a motion vector is transmitted. For blocks that don't have associated 

motion vectors, the DCT is computed then quantized and coded using a coding 
scheme and transmitted to the decoder. There are many variants of DCT-based 
encoding and ail major ITU and ISO standards such as JPEG, MPEG and H.26x 
25 are all based on block-based DCT compression technique. 
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To apply a video DCT-based approach to this invention, all motion vectors 
along with the low frequency DCT coefficients are sent to the client and stored in 
the local storage device. This information will be the LQ part of the video and 

5 represent a relatively lower quality version of the video. Then after the user has 
made the selection of a video, more DCT coefficients corresponding to the higher 
spatial frequency zone in each block will be sent to the client as the AQ portion of 
the video. In case of network congestion the lower quality version of the video 
will be displayed until the congestion is cleared. 

1 0 When dealing with moving images or video another important technique to 

reduce the bit rate that can be used along with DCT compression is motion 
compensation. The images are converted to YCrCb space, and the two 
chrominance channels (Cr and Cb) are decimated further to reduce the pixel 
count. In natural images, higher compression in the chrominance space does 

1 5 not results in noticeable visual artifacts and this is the reason the chrominance 
signals are down-sampled. 

The first step in compressing moving images is to predict motion from 
frame to frame in the temporal direction, and then to use DCT to organize the 
redundancy in the spatial directions. The DCT is applied on image blocks of 8x8 

20 pixels in size, and the motion prediction is done in the luminance (Y) channel on 
16x1 6-pixel blocks. In other words, given the 16x16 block in the current frame 
that is being compressed, a search for a close match to that block in a previous 
or future frame (there are backward prediction modes where later frames are 
sent first to allow interpolating between frames) is performed. The DCT 

25 coefficients (of either the actual data, or the difference between this block and the 
close match) are "quantized", which means that higher frequency DCT 
coefficients are disregarded. During the quantization process, many of the DCT 
coefficients will end up being zero. The quantization can change for every 
"macroblock" (a macroblock is 16x16 of Y and the corresponding 8x8's in both U 

30 
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and V). The results of all of this, which include the DCT coefficients, the motion 
vectors, and the quantization parameters is Huffman coded using fixed tables. 
The DCT coefficients have a special Huffman table that is "two-dimensional" in 
5 that one code specifies a run-length of zeros and the non-zero value that ended 
the run. Also, the motion vectors and the DC DCT components are DPCM 
(subtracted from the last one) coded. The above describes the basic structure of 
MPEG-1 and MPEG-2 compression standards. 

Standard MPEG-2 compression technology can be used in an efficient 
1 0 manner to compress the content and decompose the compressed data into LQ 
and AQ partitioning. FIG. 10 is a process diagram 500 illustrating the 
decomposition of video content using an MPEG-2 compression algorithm. The 
i video file 502 is provided to an MPEG-2 codec 504 to generate a compressed 

1 video output. The compressed video is then classified 506, or decomposed, into 

15 LQ and AQ parts. The LQ part 508 of the compressed content can contain all 

W motion vectors and low frequency DCT coefficients. The AQ portion 510 can 

7 include higher frequency DCT coefficients. The number of DCT coefficient 

% included in the AQ portion 51 0 depends on the targeted type of ADSL connection 

J| and the minimum bandwidth available. 

□ 20 In the system 200, the process 500 can be implemented in software 

executed by the server 220. The server 220 can be a PC running a conventional 
operating system and including networking cards and software for interfacing to 
the ATM switch 218 and repository 221 . 

FIG. 11 is an exemplary DCT coefficient map 450. The DCT block 
25 corresponds to an image block of 8x8 pixels. In this example, LQ for each frame 
of video consists of all motion vectors and the DCT coefficients from 1-10. The 
LQ part of the content is sent ahead of time and the different AQs are sent in 
real-time to ADSL connections with different bandwidth. For example, an ADSL 
connection with a bandwidth of 1 .5 Mbps would receive the DCT coefficients 

30 
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1-10 and additional DCT coefficients 11-15. Another ADSL connection with a 
bandwidth of 3 Mbps would receive DCT coefficients 1-15, as well as additional 
DCT coefficients 16-21. An ADSL connection with 6 Mbps of bandwidth would 
5 receive DCT coefficients 1-36. 

With MPEG-2, the set-top box 248 performs the function of reorganizing 
LQ and AQ first, and then a standard MPEG-2 decoder can be used to 
decompress the video and send it to the TV monitor 250. Reorganizing the data 
can be performed either in software or hardware. 

10 Another compression technique that can be used in the systems 20 and 

20 is sub-band coding and wavelets. With conventional sub-band coding and 
wavelet techniques, a signal spectrum is partitioned into several frequency 
bands. Each band is encoded and transmitted separately. The wavelet 
approach is a form of sub-band coding, and hence, for brevity, only sub-band 

15 coding is discussed in detail below. 

In sub-band coding, an image is first filtered to create a set of images, 
each of which contains a limited range of spatial frequencies. These images are 
called the sub-bands. Since each sub-band has a reduced bandwidth compared 
to the original full-band image, they may be down sampled. This process of 

20 filtering and sub-sampling is termed the analysis stage. The sub-bands are then 
encoded using one or more coding schemes. Different bit rates and different 
compression techniques can be used to adaptively and optimally encode the 
video. 

Reconstruction at the client is achieved by up sampling the decoded 
25 sub-bands, applying the appropriate filters, and adding the reconstructed 
sub-bands together. This stage of the process is called synthesis stage. 

The application of sub-band/wavelets to this invention is to transmit the 
lower spatial frequency sub-bands first as the LQ part of the video, and then after 
the user has made a selection, to send higher spatial frequency sub-bands as 
30 the AQ part of the video. 
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FIG. 12 is a graph 550 showing an exemplary signal spectrum for a 
sub-band coding algorithm. FIG. 12 shows the transfer function for a typical 
sub-band filter. An ideal sub-band filter is made of a bank of bandpass filters, 
5 which are contiguous in frequency, zero attenuation in the pass-band and infinite 
attenuation in stop band. When applying the bank of filters to an image or video 

signal, a series of filtered images are obtained. The bandpass filters reduce the 

bandwidth of each one of these filtered images. According to Nyquist criteria, the 

smaller size images obtained from this phase of filtering can be downsampled at 
1 0 a lower rate. In sub-band coding the combination of bandpass filtering and 

downsampling is called decimation. The inverse process that allows 

reconstructing the original signal is called Interpolation. 
=§ FIG. 13 is a conceptual decomposition map 600 of an image frame that 

'£ has been decomposed into seven sub-bands. In FIG. 13, sub-bands 1 ,2,3 and 4 

W 1 5 represent most of the information contained in the image (lower spatial frequency 
jj information). The remaining sub-bands represent sharp transition, edges of 

m object and other higher spatial frequency information. The lower sub-bands can 

Q be included in the LQ portion of a decompose file, while the higher sub-bands 

m can make up the AQ portion. 

!fJ 20 Sub-band decomposition is well suited to the video over ADSL 

i* transmission system 200. In one embodiment of the system 200, sub-bands 1 -2 

of FIG. 13 can be regarded as the core information (LQ) for each frame and can 
be downloaded in off-peak hours to the set-top box 248. The remaining bands 
are part of AQ and can selectively be sent to the set top box 248 on demand, 
25 based on he ADSL bandwidth available. 
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Commercially-available encoder and decoder chip sets for sub-band 
coding can be included in the CO 202 and set-top box 248, respectively, to 
implement a sub-band coding approach. In order to reconstruct the original 
5 content at the set-top box 248, the LQ part of the content that is already 

downloaded in the set-top box 248 is combined with the AQ that corresponds to 
the higher spatial frequency sub-bands and is decoded to reproduce the original 
content. 

FIG. 14 is a block diagram 650 illustrating vector quantization (VQ) 
10 processes for encoding and decoding content. The encoding process 652 can 
take place at the CO 202 to generate compressed, decomposed content that is 
stored in the repository 221 . The decoding process 654 takes place at the 
set-top box 248 to re-compose the content for presentation. 

Digital signals can be compressed by assigning shorter code words to 
1 5 more probable signals and that the maximum achievable compression could be 
determined from a statistical description of the signal. Coding vectors or groups 
of symbols (pixels in images, speech samples in voice, etc.) is more efficient than 
coding individual symbols or samples. Vector quantization is based on these 
principles and has been used in speech, image, and video coding. 
20 A vector quantizer Q of dimension k and size N is a mapping of a vector in 

a K dimensional Euclidean space R k into a finite set "C" containing N outputs or 
reproduction points called "codevectors" or "codewords". 



Q: R k -9 C (3) 

25 

Where C = (Y 1f Y 2 , Y 3 , . . ., Y,, . . ., Y n ), and Y, are elements of R k for each 

i = {1,2,3 N}. 

The set "C" is called the codebook 658. 
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ln video compression, VQ is used along with motion compensation to 
reduce the bit rate. Typically VQ is applied on a single video frame that is divided 
into small blocks of 4x4 pixels. For each block of 4x4 pixels, a search is 
5 performed in a codebook to obtain the closest match possible. The can be 

accomplished using a nearest neighbor rule 656. The codebook 658 consists of 
different patterns of 4x4-pixel blocks with different colors and textures. 
Codebooks are formed using clustering algorithms and that are applied on a 
large set of example content. 
1 0 Each entry in the codebook is assigned an index. After finding the closest 

match to a block, the index 660 corresponding to the matching block from the 
codebook 658 is transmitted to the decoder 654 through a transmission network 
I 662. The decoder 654 has a copy 668 of the same codebook 658, and after 

I decoding 664 the index, it performs a table look up 667 in the codebook 668 and 

V 1 5 retrieves the matching block that represents the original content. 
I The larger the codebook the higher is the video quality. However, larger 

- codebooks result also in higher bit rates. 

3 There are different ways that VQ can be included in the systems 20 and 

| 200. One approach is to decompose video content information into Luminance 

5 20 (Y) and motion vectors, and Chrominance (Cr, Cb). The LQ would be the 

combination of Y and motion vectors, and AQ would consist of the chrominance 
information. The approach can use different codebooks for Y and Cr and Cb. 
Alternative approaches rely on the segmentation of the codebook in such a way 
that a low quality version of the content is created as LQ and residual information 
25 is stored as AQ. 
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While specific embodiments of the present invention have been shown 
and described, it will be apparent to those skilled in the art that the disclosed 
invention may be modified in numerous ways and may assume many 
5 embodiments other than those specifically set out and described above. 

Accordingly, the scope of the invention is indicated in the appended claims, and 
all changes that come within the meaning and range of equivalents are intended 
to be embraced therein. 



-21 - 



